Average sentence length |
---|
18.1070 |
sentence length | percentage |
---|---|
2 | 0.0180 |
3 | 0.3290 |
4 | 1.1290 |
5 | 1.7650 |
6 | 2.4300 |
7 | 3.0910 |
8 | 3.7060 |
9 | 4.1240 |
10 | 4.3460 |
11 | 4.5530 |
12 | 4.5890 |
13 | 4.7610 |
14 | 4.6050 |
15 | 4.6130 |
16 | 4.4290 |
17 | 4.3120 |
18 | 4.2250 |
19 | 3.8860 |
20 | 3.8170 |
21 | 3.4660 |
22 | 3.2900 |
23 | 2.9160 |
24 | 2.6860 |
25 | 2.6230 |
26 | 2.4430 |
27 | 2.3060 |
28 | 2.0150 |
29 | 1.8130 |
30 | 1.6900 |
31 | 1.5310 |
32 | 1.3480 |
33 | 1.2530 |
34 | 1.1190 |
35 | 0.9330 |
36 | 0.8580 |
37 | 0.7220 |
38 | 0.6380 |
39 | 0.4670 |
40 | 0.3590 |
41 | 0.2720 |
42 | 0.1720 |
43 | 0.1460 |
44 | 0.0820 |
45 | 0.0590 |
46 | 0.0260 |
47 | 0.0190 |
48 | 0.0140 |
49 | 0.0030 |
50 | 0.0030 |
Next we measure the length of a sentence by its number of words. Again, we are interested in average length and length distribution.
See above.
For simplicity, the number of words in a sentence is calculated as 1+(Number of blanks in the sentence).
Average length:
select avg(1+char_length(sentence)-char_length( replace(sentence," ",""))) from sentences;
Distribution:
SELECT @all:=count(*) from sentences;
select 1+char_length(sentence)-char_length( replace(sentence," ","")), 100*count(*)/@all from sentences where 50>=1+char_length(sentence)-char_length(replace(sentence," ",""))group by 1+char_length(sentence)-char_length( replace( sentence," ",""));
4.2.1 Length of sentences in characters